NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Generating in-distribution proxy graphs for explaining graph neural networks

Chen, Zhuomi; Zhang, Jiaxing; Ni, Jingchao; Li, Xiaoting; Bian, Yuchen; Islam, Md Mezbahul; Mondal, Ananda Mohan; Wei, Hua; Luo, Dongsheng (July 2025, ICML'24: Proceedings of the 41st International Conference on Machine Learning)

Free, publicly-accessible full text available July 4, 2026
Evaluating SHAP’s Robustness in Precision Medicine: Effect of Filtering and Normalization

https://doi.org/10.1109/BIBM58861.2023.10385704

Sobhan, Masrur; Mondal, Ananda Mohan (December 2023, IEEE)

Full Text Available
MOGAT: A Multi-Omics Integration Framework Using Graph Attention Networks for Cancer Subtype Prediction

https://doi.org/10.3390/ijms25052788

Tanvir, Raihanul Bari; Islam, Md Mezbahul; Sobhan, Masrur; Luo, Dongsheng; Mondal, Ananda Mohan (March 2024, International Journal of Molecular Sciences)

Accurate cancer subtype prediction is crucial for personalized medicine. Integrating multi-omics data represents a viable approach to comprehending the intricate pathophysiology of complex diseases like cancer. Conventional machine learning techniques are not ideal for analyzing the complex interrelationships among different categories of omics data. Numerous models have been suggested using graph-based learning to uncover veiled representations and network formations unique to distinct types of omics data to heighten predictions regarding cancers and characterize patients’ profiles, amongst other applications aimed at improving disease management in medical research. The existing graph-based state-of-the-art multi-omics integration approaches for cancer subtype prediction, MOGONET, and SUPREME, use a graph convolutional network (GCN), which fails to consider the level of importance of neighboring nodes on a particular node. To address this gap, we hypothesize that paying attention to each neighbor or providing appropriate weights to neighbors based on their importance might improve the cancer subtype prediction. The natural choice to determine the importance of each neighbor of a node in a graph is to explore the graph attention network (GAT). Here, we propose MOGAT, a novel multi-omics integration approach, leveraging GAT models that incorporate graph-based learning with an attention mechanism. MOGAT utilizes a multi-head attention mechanism to extract appropriate information for a specific sample by assigning unique attention coefficients to neighboring samples. Based on our knowledge, our group is the first to explore GAT in multi-omics integration for cancer subtype prediction. To evaluate the performance of MOGAT in predicting cancer subtypes, we explored two sets of breast cancer data from TCGA and METABRIC. Our proposed approach, MOGAT, outperforms MOGONET by 32% to 46% and SUPREME by 2% to 16% in cancer subtype prediction in different scenarios, supporting our hypothesis. Our results also showed that GAT embeddings provide a better prognosis in differentiating the high-risk group from the low-risk group than raw features.
more » « less
Full Text Available
Explainable Machine Learning to Identify Patient-specific Biomarkers for Lung Cancer

https://doi.org/10.1109/BIBM55620.2022.9995516

Sobhan, Masrur; Mondal, Ananda Mohan (December 2022, 2022 IEEE International Conference on Bioinformatics and Biomedicine (IEEE BIBM))

Full Text Available
An Autoencoder Based Bioinformatics Framework for Predicting Prognosis of Breast Cancer Patients

https://doi.org/10.1109/BIBM55620.2022.9995632

Tanvir, Raihanul Bari; Sobhan, Masrur; Mondal, Ananda Mohan (December 2022, 2022 IEEE International Conference on Bioinformatics and Biomedicine (IEEE BIBM))

Full Text Available
Potential Autoimmunity Resulting from Molecular Mimicry between SARS-CoV-2 Spike and Human Proteins

https://doi.org/10.3390/v14071415

Nunez-Castilla, Janelle; Stebliankin, Vitalii; Baral, Prabin; Balbin, Christian A.; Sobhan, Masrur; Cickovski, Trevor; Mondal, Ananda Mohan; Narasimhan, Giri; Chapagain, Prem; Mathee, Kalai; et al (July 2022, Viruses)

Molecular mimicry between viral antigens and host proteins can produce cross-reacting antibodies leading to autoimmunity. The coronavirus SARS-CoV-2 causes COVID-19, a disease curiously resulting in varied symptoms and outcomes, ranging from asymptomatic to fatal. Autoimmunity due to cross-reacting antibodies resulting from molecular mimicry between viral antigens and host proteins may provide an explanation. Thus, we computationally investigated molecular mimicry between SARS-CoV-2 Spike and known epitopes. We discovered molecular mimicry hotspots in Spike and highlight two examples with tentative high autoimmune potential and implications for understanding COVID-19 complications. We show that a TQLPP motif in Spike and thrombopoietin shares similar antibody binding properties. Antibodies cross-reacting with thrombopoietin may induce thrombocytopenia, a condition observed in COVID-19 patients. Another motif, ELDKY, is shared in multiple human proteins, such as PRKG1 involved in platelet activation and calcium regulation, and tropomyosin, which is linked to cardiac disease. Antibodies cross-reacting with PRKG1 and tropomyosin may cause known COVID-19 complications such as blood-clotting disorders and cardiac disease, respectively. Our findings illuminate COVID-19 pathogenesis and highlight the importance of considering autoimmune potential when developing therapeutic interventions to reduce adverse reactions.
more » « less
Full Text Available
Stage-Specific Co-expression Network Analysis for Cancer Biomarker Discovery

https://doi.org/10.1109/BIBM49941.2020.9313242

Tanvir, Raihanul Bari; Mondal, Ananda Mohan (December 2020, 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM))
null (Ed.)
Full Text Available
Multi-Run Concrete Autoencoder to Identify Prognostic lncRNAs for 12 Cancers

https://doi.org/10.3390/ijms222111919

Al Mamun, Abdullah; Tanvir, Raihanul Bari; Sobhan, Masrur; Mathee, Kalai; Narasimhan, Giri; Holt, Gregory E.; Mondal, Ananda Mohan (November 2021, International Journal of Molecular Sciences)

Background: Long non-coding RNA plays a vital role in changing the expression profiles of various target genes that lead to cancer development. Thus, identifying prognostic lncRNAs related to different cancers might help in developing cancer therapy. Method: To discover the critical lncRNAs that can identify the origin of different cancers, we propose the use of the state-of-the-art deep learning algorithm concrete autoencoder (CAE) in an unsupervised setting, which efficiently identifies a subset of the most informative features. However, CAE does not identify reproducible features in different runs due to its stochastic nature. We thus propose a multi-run CAE (mrCAE) to identify a stable set of features to address this issue. The assumption is that a feature appearing in multiple runs carries more meaningful information about the data under consideration. The genome-wide lncRNA expression profiles of 12 different types of cancers, with a total of 4768 samples available in The Cancer Genome Atlas (TCGA), were analyzed to discover the key lncRNAs. The lncRNAs identified by multiple runs of CAE were added to a final list of key lncRNAs that are capable of identifying 12 different cancers. Results: Our results showed that mrCAE performs better in feature selection than single-run CAE, standard autoencoder (AE), and other state-of-the-art feature selection techniques. This study revealed a set of top-ranking 128 lncRNAs that could identify the origin of 12 different cancers with an accuracy of 95%. Survival analysis showed that 76 of 128 lncRNAs have the prognostic capability to differentiate high- and low-risk groups of patients with different cancers. Conclusion: The proposed mrCAE, which selects actual features, outperformed the AE even though it selects the latent or pseudo-features. By selecting actual features instead of pseudo-features, mrCAE can be valuable for precision medicine. The identified prognostic lncRNAs can be further studied to develop therapies for different cancers.
more » « less
Full Text Available
Pan-cancer Feature Selection and Classification Reveals Important Long Non-coding RNAs

https://doi.org/10.1109/BIBM49941.2020.9313332

Mamun, Abdullah Al; Duan, Wenrui; Mondal, Ananda Mohan (December 2020, 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM))
null (Ed.)
Full Text Available
Deep Learning to Discover Genomic Signatures for Racial Disparity in Lung Cancer

https://doi.org/10.1109/BIBM49941.2020.9313426

Sobhan, Masrur; Mamun, Abdullah Al; Tanvir, Raihanul Bari; Alfonso, Mario Jacas; Valle, Pablo; Mondal, Ananda Mohan (December 2020, 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM))
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records